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(vii) PRIOR APPLICATION DATA: 



(A) APPLICATION NUMBER: 
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(B) FILING DATE: 12-JAN-1996 

(viii) : ATTORNEY /AGENT INFORMATION: 

(A) NAME: MASS , Clifford J. 

(B) REGISTRATION NO. : 30,086 

(C) REF . /DOCKET NO . : U-011098-6 

(ix) TELECOMMUNICATION INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 25.. 2485 

(D) OTHER INFORMATION: /function= "trehalose phosph. 
synthase and trehalose phosph. phosphatase" 
/product= "bipartite enzyme" 



(ix) FEATURE: 

(A) NAME / KEY : unsure 

(B) LOCATION: 1609.. 1611 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CTGATCCTGC GGTTTCATCA CAAT ATG ATA CTC TTA CAT CTG ATG CCC CTT 51 

Met lie Leu Leu His Leu Met Pro Leu 
1 5 

CAG ATG CTC CCA AAT AGG TTG ATT GTC GTA TCG AAT CAG TTA CCC ATA 99 
Gin Met Leu Pro Asn Arg Leu lie Val Val Ser Asn Gin Leu Pro lie 
10 15 20 25 

ATC GCT AGG CTA AGA CTA ACG ACA ATG GAG GGT CCT TTT GGG ATT TCA 14 7 

lie Ala Arg Leu Arg Leu Thr Thr Met Glu Gly Pro Phe Gly lie Ser 
30 35 40 

CTT GGG ACG AGA GTT CGA TTT ACA TGC ACA TCA AAG ATG CAT TAC CCG 195 
Leu Gly Thr Arg Val Arg Phe Thr Cys Thr Ser Lys Met His Tyr Pro 
45 50 55 

CAG CCG TTG AGG TTT TCT ATT CTT GGC GAT CCA CTA AGG GCT GAC GTT 243 
Gin Pro Leu Arg Phe Ser lie Leu Gly Asp Pro Leu Arg Ala Asp Val ' 
60 65 70 

GGC CCT ACC GAA CAA GAT GAC GTG TCA AAG ACA TTG CTC GAT AGG TTT 2 91 

Gly Pro Thr Glu Gin Asp Asp Val Ser Lys Thr Leu Leu Asp Arg Phe 
75 80 85 

AAT TGC GTT GCG GTT TTT GTC CCT ACT TCA AAA TGG GAC CAA TAT TAT 339 
Asn Cys Val Ala Val Phe Val Pro Thr Ser Lys Trp Asp Gin Tyr Tyr 
90 95 100 105 

CAC TGC TTT TGT AAG CAG TAT TTG TGG CCG ATA TTT CAT TAC AAG GTT 387 
His Cys Phe Cys Lys Gin Tyr Leu Trp Pro lie Phe His Tyr Lys Val 
110 115 120 

CCC GCT TCT GAC GTC AAG AGT GTC CCG AAT AGT CGG GAT TCA TGG AAC 43 5 

Pro Ala Ser Asp Val Lys Ser Val Pro Asn Ser Arg Asp Ser Trp Asn 
125 130 135 

GCT TAT GTT CAC GTG AAC AAA GAG TTT TCC CAG AAG GTG ATG GAG GCA 483 
Ala Tyr Val His Val Asn Lys Glu Phe Ser Gin Lys Val Met Glu Ala 
140 145 150 

GTA ACC AAT CGT AGC AAT TAT GTA TGG ATA CAT GAC TAC CAT TTA ATG 531 
Val Thr Asn Arg Ser Asn Tyr Val Trp lie Hie Asp Tyr His Leu Met 
155 160 165 

ACG CTA CCG ACT TTC TTG AGG CGG GAT TTT TGT CGT TTT AAA ATC GGT 579 
Thr Leu Pro Thr Phe Leu Arg Arg Asp Phe Cys Arg Phe Lys lie Gly 
170 175 180 185 

TTT TTT CTG CAT AGC CCG TTT CCT TCC TCG GAG GTT TAC AAG ACC CTA 62 7 

Phe Phe Leu His Ser Pro Phe Pro Ser Ser Glu Val Tyr Lys Thr Leu 
190 195 200 



^ J 



CCA ATG AGA AAC GAG CTC TTG AAG GGT CTG TTA AAT GCT GAT CTT ATC 6 75 

Pro Met Arg Asn Glu Leu Leu Lys Gly Leu Leu Asn Ala Asp Leu He 
205 210 215 

GGG TTC CAT ACA TAC GAT TAT GCC CGT CAT TTT CTA ACG TGT TGT AGT 72 3 

Gly Phe His Thr Tyr Asp Tyr Ala Arg His Phe Leu Thr Cys Cys Ser 
220 225 230 

CGA ATG TTT GGT TTG GAT CAT CAG TTG AAA AGG GGG TAC ATT TTC TTG 771 
Arg Met Phe Gly Leu Asp His Gin Leu Lys Arg Gly Tyr He Phe Leu 
235 240 245 

GAA TAT AAT GGA AGG AGC ATT GAG ATC AAG ATA AAG GCG AGC GGG ATT 819 
Glu Tyr Asn Gly Arg Ser He Glu He Lys He Lys Ala Ser Gly He 
250 255 260 265 

CAT GTT GGT CGA ATG GAG TCG TAC TTG AGT CAG CCC GAT ACA AGA TTA 86 7 

His Val Gly Arg Met Glu Ser Tyr Leu Ser Gin Pro Asp Thr Arg Leu 
270 275 280 

CAA GTT CAA GAA GTC CAA AAA CGT TCG AAG GAA ATC GTG CTA CTG GGA 915 
Gin Val Gin Glu Val Gin Lys Arg Ser Lys Glu He Val Leu Leu Gly 
285 290 295 

GTT GAT GAT TTG GAT ATA TTC AAA GGT GTG AAC TTC AAG GTT TTA GCG 963 
Val Asp Asp Leu Asp He Phe Lys Gly Val Asn Phe Lys Val Leu Ala 
300 305 310 

TTG GAG AAG TTA CTT AAA TCA CAC CCG AGT TGG CAA GGG CGT GTG GAA 1011 
Leu Glu Lys Leu Leu Lys Ser His Pro Ser Trp Gin Gly Arg Val Glu 
315 320 325 

AAG GTG CAA ATC TTG AAT CCT CTG CGC CGT TGC CAA GAC GTC GAT GAG 1059 
Lys Val Gin He Leu Asn Pro Leu Arg Arg Cys Gin Asp Val Asp Glu 
330 335 340 345 

ATC AAT GCC GAG ATA AGA ACA GTC TGT GAA AGA ATC AAT AAC GAA CTG 1107 
He Asn Ala Glu He Arg Thr Val Cys Glu Arg He Asn Asn Glu Leu 
350 355 360 

GGA AGC CCG GGA TAC CAG CCC GTT GTG TTA ATT GAT GGG CCC GTT TCG 1155 
Gly Ser Pro Gly Tyr Gin Pro Val Val Leu He Asp Gly Pro Val Ser 
365 370 375 

TTA AGT GAA AAA GCT GCT TAT TAT GCT ATC GCC GAT ATG GCA ATT GTT 1203 
Leu Ser Glu Lys Ala Ala Tyr Tyr Ala He Ala Asp Met Ala He Val 
380 385 390 

ACA CCG TTA CGT GAC GGA CTG AAT CTT ATC CCG TAC GAG TAC GTC GTT 1251 
Thr Pro Leu Arg Asp Gly Leu Asn Leu He Pro Tyr Glu Tyr Val Val 
395 400 405 



TCC CGA CAA AGT GTT AAT GAC CCA AAT CCC AAT ACT CCA AAA AAG AGC 1299 
Ser Arg Gin Ser Val Asn Asp Pro Asn Pro Asn Thr Pro Lys Lys Ser 
410 415 420 425 

ATG CTA GTG GTC TCC GAG TTC ATC GGT GTT TCA CTA TCT TTA ACC GGG 134 7 

Met Leu Val Val Ser Glu Phe lie Gly Val Ser Leu Ser Leu Thr Gly 
430 435 440 

GCC ATA CGG GTC AAC CCA TGG GAT GAG TTG GAG ACA GCA GAA GCA TTA 13 95 

Ala lie Arg Val Asn Pro Trp Asp Glu Leu Glu Thr Ala Glu Ala Leu 
445 450 455 

TAC GAC GCA CTC ATG GCT CCT GAT GAC CAT AAA GAA ACC GCC CAC ATG 1443 
Tyr Asp Ala Leu Met Ala Pro Asp Asp His Lys Glu Thr Ala His Met 
460 465 470 

AAA CAG TAT CAA TAC ATT ATC TCC CAT GAT GTA GCT AAC TGG GCT AGC 1491 
Lys Gin Tyr Gin Tyr lie lie Ser His Asp Val Ala Asn Trp Ala Ser 
475 480 485 

TTC TTT CAA GAT TTA GAG CAA GCG TGC ATC GAT CAT TCT CGT AAA CGA 153 9 

Phe Phe Gin Asp Leu Glu Gin Ala Cys He Asp His Ser Arg Lys Arg 
490 495 500 505 

TGC ATG AAT TTA GGA TTT GGG TTA GAT ACT AGA GTC GTC TTT TTG ATG 1587 
Cys Met Asn Leu Gly Phe Gly Leu Asp Thr Arg Val Val Phe Leu Met 
510 515 520 

AGA AGT TTA GCA AGT TGG ATA AAG ATG TCT TGG AAG AAT GCT TAT TCC 1635 
Arg Ser Leu Ala Ser Trp He Lys Met Ser Trp Lys Asn Ala Tyr Ser 
525 530 535 

ATG GCT CAA AAT CGG GCC ATA CTT TTG GAC TAT GAC GGC ACT GTT ACT 16 8 3 

Met Ala Gin Asn Arg Ala He Leu Leu Asp Tyr Asp Gly Thr Val Thr 
540 545 550 

CCA TCT ATC AGT AAA TCT CCA ACT GAA GCT GTT ATC TCC ATG ATC AAC 1731 
Pro Ser He Ser Lys Ser Pro Thr Glu Ala Val He Ser Met He Asn 
555 560 565 

AAA CTG TGC AAT GAT CCA AAG AAC ATG GTG TTC ATC GTT AGT GGA CGC 17 7 9 

Lys Leu Cys Asn Asp Pro Lys Asn Met Val Phe He Val Ser Gly Arg 
570 575 580 585 

4 

AGT AGA GAG AAA ATC TTG GCA GTT GGT TCG GCG CGT GTG AGA ACC CGC 182 7 

Ser Arg Glu Lys He Leu Ala Val Gly Ser Ala Arg Val Arg Thr Arg 
590 595 600 

CAT TGC ACT GAG CAC GGA TAC TTT ATA AGG TGG GCG GGT GAT CAA GAA 18 75 

His Cys Thr Glu His Gly Tyr Phe He Arg Trp Ala Gly Asp Gin Glu 
605 610 615 



TGG GAA ACG TGC GCA CGT GAG AAT AAT GTC GGG TGG ATG GAT GGA AAT 1923 
Trp Glu Thr Cys Ala Arg Glu Asn Asn Val Gly Trp Met Asp Gly Asn 
620 625 630 

CTG AGG CCG GTT ATG AAT CTT TAT ACA GAA ACT ACT GAC GGT TCG TAT 1971 
Leu Arg Pro Val Met Asn Leu Tyr Thr Glu Thr Thr Asp Gly Ser Tyr 
635 640 645 

ATT GAA AAG AAA GAA ACT GCA ATG GTT TGG CAC TAT GAA GAT GCT GAT 2019 
He Glu Lys Lys Glu Thr Ala Met Val Trp His Tyr Glu Asp Ala Asp 
650 655 660 665 

AAA GAT CTT GGG TTG GAG CAG GCT AAG GAA CTG TTG GAC CAT CTT GAA 2067 
Lys Asp Leu Gly Leu Glu Gin Ala Lys Glu Leu Leu Asp His Leu Glu 
670 675 680 

AAC GTG CTC GCT AAT GAG CCC GTT GGA GTG AAT CGA ACA GGT CAA TAC 2115 
Asn Val Leu Ala Asn Glu Pro Val Gly Val Asn Arg Thr Gly Gin Tyr 
685 690 695 

ATT GTA GAA GTT AAA CCA CAG TCC CCC ATT AAT TAC CTT CTT GTT ATG 2163 
He Val Glu Val Lys Pro Gin Ser Pro He Asn Tyr Leu Leu Val Met 
700 705 710 

ACA TTC ATA GGC ACT GAT TGT AGA ATC TTT AAC TTA AAT TTC TTT AAA 2211 
Thr Phe He Gly Thr Asp Cys Arg He Phe Asn Leu Asn Phe Phe Lys 
715 720 725 

TAT GAA TGC AAT TAT AGG GGG TCA CTA AAA GGT ATA GTT GCA GAG AAG 2259 
Tyr Glu Cys Asn Tyr Arg Gly Ser Leu Lys Gly lie Val Ala Glu Lys 
730 735 740 745 

ATT TTT GCG TTC ATG GCT AAA AAG GGA AAA CAG GCT GAT TTC GTG TTG 23 07 

He Phe Ala Phe Met Ala Lys Lys Gly Lys Gin Ala' Asp Phe Val Leu 
750 755 760 

ACG TTG AAT GAT AGA AGT GAT GAA GAC ATG TTT GTG GCC ATT GGG GAT 23 55 

Thr Leu Asn Asp Arg Ser Asp Glu Asp Met Phe Val Ala He Gly Asp 
765 770 775 

GGA ATA AAA AAG GGT CGG ATA ACT AAC AAC AAT TCA GTG TTT ACA TGC 2403 
Gly He Lys Lys Gly Arg He Thr Asn Asn Asn Ser Val Phe Thr Cys 
780 785 790 

GTA GTG GGA GAG AAA CCG AGT GCA GCT GAG TAC TTT TTA AAT GAT GTC 2451 
Val Val Gly Glu Lys Pro Ser Ala Ala Glu Tyr Phe Leu Asn Asp Val 
795 800 805 

TCG AGA AGC TCC GGG TGT CTC AGC AAC CAA GGA T GATCCGGAAG 24 95 

Ser Arg Ser Ser Gly Cys Leu Ser Asn Gin Gly 
810 815 820 

CTTCTCGTGA TCTTTATGAG TTAAAAGTTT TCGACTTTTT CTTCATCAAG ATTCATGGGA 2555 



AAGTTGTTCA ATATGAACTT GTGTTCTTGG TTCTGGATTT TAGGGAGTCT ATGGATATAA 2615 



CATTTC 2621 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met lie Leu Leu His Leu Met Pro Leu Gin Met Leu Pro Asn Arg Leu 
15 10 15 

lie Val Val Ser Asn Gin Leu Pro lie lie Ala Arg Leu Arg Leu Thr 
20 25 30 

Thr Met Glu Gly Pro Phe Gly lie Ser Leu Gly Thr Arg Val Arg Phe 
35 40 45 

Thr Cys Thr Ser Lys Met His Tyr Pro Gin Pro Leu Arg Phe Ser lie 
50 55 60 

Leu Gly Asp Pro Leu Arg Ala Asp Val Gly Pro Thr Glu Gin Asp Asp 
65 70 75 80 

Val Ser Lys Thr Leu Leu Asp Arg Phe Asn Cys Val Ala Val Phe Val 
85 90 95 

Pro Thr Ser Lys Trp Asp Gin Tyr Tyr His Cys Phe Cys Lys Gin Tyr 
100 105 110 

Leu Trp Pro lie Phe His Tyr Lys Val Pro Ala Ser Asp Val Lys Ser 
115 120 125 

Val Pro Asn Ser Arg Asp Ser Trp Asn Ala Tyr Val His Val Asn Lys 
130 135 140 

Glu Phe Ser Gin Lys Val Met Glu Ala Val Thr Asn Arg Ser Asn Tyr 
145 150 155 160 

Val Trp lie His Asp Tyr His Leu Met Thr Leu Pro Thr Phe Leu Arg 
165 170 175 

Arg Asp Phe Cys Arg Phe Lys lie Gly Phe Phe Leu His Ser Pro Phe 
180 185 190 

Pro Ser Ser Glu Val Tyr Lys Thr Leu Pro Met Arg Asn Glu Leu Leu 
195 200 205 



Lys Gly Leu Leu Asn Ala Asp Leu He Gly Phe His Thr Tyr Asp Tyr 
210 215 220 



Ala Arg His Phe 
225 

Gin Leu Lys Arg 



Glu He Lys He 
260 

Tyr Leu Ser Gin 
275 

Arg Ser Lys Glu 
290 



Leu Thr Cys Cys 
230 

Gly Tyr He Phe 
245 

Lys Ala Ser Gly 



Pro Asp Thr Arg 
280 

He Val Leu Leu 
295 



Ser Arg Met Phe 
235 

Leu Glu Tyr Asn 
250 

He His Val Gly 
265 

Leu Gin Val Gin 



Gly Val Asp Asp 
300 



Gly Leu Asp His 
240 

Gly Arg Ser He 
255 

Arg Met Glu Ser 
270 

Glu Val Gin Lys 
285 

Leu Asp He Phe 



Lys Gly Val Asn Phe Lys Val Leu Ala Leu Glu Lys Leu Leu Lys Ser 
305 310 315 320 

His Pro Ser Trp Gin Gly Arg Val Glu Lys Val Gin He Leu Asn Pro 
325 330 335 

Leu Arg Arg Cys Gin Asp Val Asp Glu lie Asn Ala Glu He Arg Thr 
340 345 350 

Val Cys Glu Arg He Asn Asn Glu Leu Gly Ser Pro Gly Tyr Gin Pro 
355 360 365 

Val Val Leu He Asp Gly Pro Val Ser Leu Ser Glu Lys Ala Ala Tyr 
370 375 380 

Tyr Ala He Ala Asp Met Ala He Val Thr Pro Leu Arg Asp Gly Leu 
385 390 395 400 

Asn Leu He Pro Tyr Glu Tyr Val Val Ser Arg Gin Ser Val Asn Asp 
405 410 415 

Pro Asn Pro Asn Thr Pro Lys Lys Ser Met Leu Val Val Ser Glu Phe 
420 425 430 

He Gly Val Ser Leu Ser Leu Thr Gly Ala He Arg Val Asn Pro Trp 
435 440 * 445 

Asp Glu Leu Glu Thr Ala Glu Ala Leu Tyr Asp Ala Leu Met Ala Pro 
450 455 460 

Asp Asp His Lys Glu Thr Ala His Met Lys Gin Tyr Gin Tyr He He 
465 470 475 480 

Ser His Asp Val Ala Asn Trp Ala Ser Phe Phe Gin Asp Leu Glu Gin 
485 490 495 



Ala Cys lie Asp His 
500 

Leu Asp Thr Arg Val 
515 

Lys Met Ser Trp Lys 
530 

Leu Leu Asp Tyr Asp 
545 

Thr Glu Ala Val lie 
565 

Asn Met Val Phe lie 
580 

Val Gly Ser Ala Arg 
595 

Phe lie Arg Trp Ala 
610 

Asn Asn Val Gly Trp 
625 

Tyr Thr Glu Thr Thr 
645 

Met Val Trp His Tyr 
660 

Ala Lys Glu Leu Leu 
675 

Val Gly Val Asn Arg 
690 

Ser Pro lie Asn Tyr 
705 

Arg lie Phe Asn Leu 
725 

Ser Leu Lys Gly lie 
740 

Lys Gly Lys Gin Ala 
755 

Glu Asp Met Phe Val 
770 



Ser Arg Lys Arg Cys Met 
505 

Val Phe Leu Met Arg Ser 
520 

Asn Ala Tyr Ser Met Ala 
535 

Gly Thr Val Thr Pro Ser 
550 555 

Ser Met lie Asn Lys Leu 
570 

Val Ser Gly Arg Ser Arg 
585 

Val Arg Thr Arg His Cys 
600 

Gly Asp Gin Glu Trp Glu 
615 

Met Asp Gly Asn Leu Arg 
630 635 

Asp Gly Ser Tyr lie Glu 
650 

Glu Asp Ala Asp Lys Asp 
665 

Asp His Leu Glu Asn Val 
680 

Thr Gly Gin Tyr lie Val 
695 

Leu Leu Val Met Thr Phe 
710 715 

Asn Phe Phe Lys Tyr Glu 
730 

Val Ala Glu Lys lie Phe 
745 

Asp Phe Val Leu Thr Leu 
760 

Ala He Gly Asp Gly He 
775 



Asn Leu Gly Phe Gly 
510 

Leu Ala Ser Trp He 
525 

Gin Asn Arg Ala He 
540 

He Ser Lys Ser Pro 
560 

Cys Asn Asp Pro Lys 
575 

Glu Lys He Leu Ala 
590 

Thr Glu His Gly Tyr 
605 

Thr Cys Ala Arg Glu 
620 

Pro Val Met Asn Leu 
640 

Lys Lys Glu Thr Ala 
655 

Leu Gly Leu Glu Gin 
670 

Leu Ala Asn Glu Pro 
685 

Glu Val Lys Pro Gin 
700 

He Gly Thr Asp Cys 
720 

Cys Asn Tyr Arg Gly 
735 

Ala Phe Met Ala Lys 
750 

Asn Asp Arg Ser Asp 
765 

Lys Lys Gly Arg He 
780 




Thr Asn Asn Asn 
785 



Ser Val Phe Thr Cys 
790 



Val Val Gly Glu Lys Pro Ser 
795 800 



Ala Ala Glu Tyr 



Phe Leu Asn Asp Val 
805 



Ser Arg Ser Ser Gly Cys Leu 
810 815 



Ser Asn Gin Gly 
820 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AAGCTTATGT TGCCATATAG AGTAG 2 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GTAGTTGCCA TGGTGCAAAT GTTC 



24 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
AGCTCTGCAG TGAGGT AC C A 
(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
GACGTCACTC CATGGTTCGA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
GTACCCTGCA GTGTGACCCT AGAC 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TCGATTCATA GAAGCTTAGA T 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuberosum 

(B) STRAIN: Kardal 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 161.. 1906 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 842.. 850 

(D) OTHER INFORMATION: /function^ "putative 
glycosylat ions i te " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CTTTTCTGAG TAATAACATA GGCATTGATT TTTTTTCAAT TAATAACACC TGCAAACATT 
CCCATTGCCG GCATTCTCTG TTCTTACAAA AAAAAACATT TTTTTGTTCA CATAAATTAG 



TTATGGCATC AGTATTGAAC CCTTTAACTT GTTATACAAT ATG GGT AAA GCT ATA 175 

Met Gly Lys Ala lie 
1 5 

ATT TTT ATG ATT TTT ACT ATG TCT ATG AAT ATG ATT AAA GCT GAA ACT 223 
lie Phe Met lie Phe Thr Met Ser Met Asn Met lie Lys Ala Glu Thr 
10 15 20 

TGC AAA TCC ATT GAT AAG GGT CCT GTA ATC CCA ACA ACC CCT TTA GTG 271 
Cys Lys Ser lie Asp Lys Gly Pro Val lie Pro Thr Thr Pro Leu Val 
25 30 35 

ATT TTT CTT GAA AAA GTT CAA GAA GCT GCT CTT CAA ACT TAT GGC CAT 319 
He Phe Leu Glu Lys Val Gin Glu Ala Ala Leu Gin Thr Tyr Gly His 
40 45 50 

AAA GGG TTT GAT GCT AAA CTG TTT GTT GAT ATG TCA CTG AGA GAG AGT 367 

Lys Gly Phe Asp Ala Lys Leu Phe Val Asp Met Ser Leu Arg Glu Ser 
55 60 65 

f 

CTT TCA GAA ACA GTT GAA GCT TTT AAT AAG CTT CCA AGA GTT GTG AAT 415 

Leu Ser Glu Thr Val Glu Ala Phe Asn Lys Leu Pro Arg Val Val Asn 
70 75 80 85 

GGT TCA ATA TCA AAA AGT GAT TTG GAT GGT TTT ATA GGT AGT TAC TTG 463 
Gly Ser He Ser Lys Ser Asp Leu Asp Gly Phe He Gly Ser Tyr Leu 
90 95 100 

AGT AGT CCT GAT AAG GAT TTG GTT TAT GTT GAG CCT ATG GAT TTT GTG 511 
Ser Ser Pro Asp Lys Asp Leu Val Tyr Val Glu Pro Met Asp Phe Val 
105 110 115 

GCT GAG CCT GAA GGC TTT TTG CCA AAG GTG AAG AAT TCT GAG GTG AGG 559 
Ala Glu Pro Glu Gly Phe Leu Pro Lys Val Lys Asn Ser Glu Val Arg 
120 125 130 

GCA TGG GCA TTG GAG GTG CAT TCA CTT TGG AAG AAT TTA AGT AGG AAA 6 07 

Ala Trp Ala Leu Glu Val His Ser Leu Trp Lys Asn Leu Ser Arg Lys 
135 140 145 

GTG GCT GAT CAT GTA TTG GAA AAA CCA GAG TTG TAT ACT TTG CTT CCA 655 
Val Ala Asp His Val Leu Glu Lys Pro Glu Leu Tyr Thr Leu Leu Pro 
150 155 160 165 

TTG AAA AAT CCA GTT ATT ATA CCG GGA TCG CGT TTT AAG GAG GTT TAT 703 
Leu Lys Asn Pro Val He He Pro Gly Ser Arg Phe Lys Glu Val Tyr 
170 175 180 

TAT TGG GAT TCT TAT TGG GTA ATA AGG GGT TTG TTA GCA AGC AAA ATG 751 
Tyr Trp Asp Ser Tyr Trp Val lie Arg Gly Leu Leu Ala Ser Lys Met 
185 190 195 



TAT GAA ACT GCA AAA GGG ATT GTG ACT AAT CTG GTT TCT CTG ATA GAT 7 99 

Tyr Glu Thr Ala Lys Gly lie Val Thr Asn Leu Val Ser Leu lie Asp 
200 205 210 

CAA TTT GGT TAT GTT CTT AAC GGT GCA AGA GCA TAC TAC AGT AAC AGA 84 7 

Gin Phe Gly Tyr Val Leu Asn Gly Ala Arg Ala Tyr Tyr Ser Asn Arg 
215 220 225 

AGT CAG CCT CCT GTC CTG GCC ACG ATG ATT GTT GAC ATA TTC AAT CAG 8 95 

Ser Gin Pro Pro Val Leu Ala Thr Met lie Val Asp lie Phe Asn Gin 
230 235 240 245 

ACA GGT GAT TTA AAT TTG GTT AGA AGA TCC CTT CCT GCT TTG CTC AAG 943 
Thr Gly Asp Leu Asn Leu Val Arg Arg Ser Leu Pro Ala Leu Leu Lys 
250 255 260 

GAG AAT CAT TTT TGG AAT TCA GGA ATA CAT AAG GTG ACT ATT CAA GAT 991 
Glu Asn His Phe Trp Asn Ser Gly lie His Lys Val Thr lie Gin Asp 
265 270 275 

GCT CAG GGA TCA AAC CAC AGC TTG AGT CGG TAC TAT GCT ATG TGG AAT 103 9 

Ala Gin Gly Ser Asn His Ser Leu Ser Arg Tyr Tyr Ala Met Trp Asn 
280 285 290 

AAG CCC CGT CCA GAA TCG TCA ACT ATA GAC AGT GAA ACA GCT TCC GTA 1087 
Lys Pro Arg Pro Glu Ser Ser Thr lie Asp Ser Glu Thr Ala Ser Val 
295 300 305 

CTC CCA AAT ATA TGT GAA AAA AGA GAA TTA TAC CGT GAA CTG GCA TCA 1135 
Leu Pro Asn lie Cys Glu Lys Arg Glu Leu Tyr Arg Glu Leu Ala Ser 
310 315 320 325 

GCT GCT GAA AGT GGA TGG GAT TTC AGT TCA AGA TGG ATG AGC AAC GGA 1183 
Ala Ala Glu Ser Gly Trp Asp Phe Ser Ser Arg Trp Met Ser Asn Gly 
330 335 340 

TCT GAT CTG ACA ACA ACT AGT ACA ACA TCA ATT CTA CCA GTT GAT TTG 1231 
Ser Asp Leu Thr Thr Thr Ser Thr Thr Ser lie Leu Pro Val Asp Leu 
345 350 355 

AAT GCA TTC CTT CTG AAG ATG GAA CTT GAC ATT GCC TTT CTA GCA AAT 127 9 

Asn Ala Phe Leu Leu Lys Met Glu Leu Asp lie Ala Phe Leu Ala Asn 
360 365 370 

CTT GTT GGA GAA AGT AGC ACG GCT TCA CAT TTT ACA GAA GCT GCT CAA 132 7 

Leu Val Gly Glu Ser Ser Thr Ala Ser His Phe Thr Glu Ala Ala Gin 
375 380 385 

AAT AGA CAG AAG GCT ATA AAC TGT ATC TTT TGG AAC GCA GAG ATG GGG 13 7 5 

Asn Arg Gin Lys Ala lie Asn Cys He Phe Trp Asn Ala Glu Met Gly 
390 395 400 405 



CAA TGG CTT GAT TAC TGG CTT ACC AAC AGC GAC ACA TCT GAG GAT ATT 
Gin Trp Leu Asp Tyr Trp Leu Thr Asn Ser Asp Thr Ser Glu Asp lie 
410 415 420 



1423 



TAT AAA TGG GAA GAT TTG CAC CAG AAC AAG AAG TCA TTT GCC TCT AAT 1471 
Tyr Lys Trp Glu Asp Leu His Gin Asn Lys Lys Ser Phe Ala Ser Asn 
425 430 435 

TTT GTT CCG CTG TGG ACT GAA ATT TCT TGT TCA GAT AAT AAT ATC ACA 1519 
Phe Val Pro Leu Trp Thr Glu lie Ser Cys Ser Asp Asn Asn lie Thr 
440 445 450 

ACT CAG AAA GTA GTT CAA AGT CTC ATG AGC TCG GGC TTG CTT CAG CCT 156 7 

Thr Gin Lys Val Val Gin Ser Leu Met Ser Ser Gly Leu Leu Gin Pro 
455 460 465 

GCA GGG ATT GCA ATG ACC TTG TCT AAT ACT GGA CAG CAA TGG GAT TTT 1615 
Ala Gly lie Ala Met Thr Leu Ser Asn Thr Gly Gin Gin Trp Asp Phe 
470 475 480 485 

t 

CCG AAT GGT TGG CCC CCC CTT CAA CAC ATA ATC ATT GAA GGT CTC TTA 166 3 

Pro Asn Gly Trp Pro Pro Leu Gin His lie lie lie Glu Gly Leu Leu 
490 495 500 

AGG TCT GGA CTA GAA GAG GCA AGA ACC TTA GCA AAA GAC ATT GCT ATT 1711 
Arg Ser Gly Leu Glu Glu Ala Arg Thr Leu Ala Lys Asp He Ala He 
505 510 515 

CGC TGG TTA AGA ACT AAC TAT GTG ACT TAC AAG AAA ACC GGT GCT ATG 17 59 

Arg Trp Leu Arg Thr Asn Tyr Val Thr Tyr Lys Lys Thr Gly Ala Met 
520 525 530 

TAT GAA AAA TAT GAT GTC ACA AAA TGT GGA GCA TAT GGA GGT GGT GGT 1807 
Tyr Glu Lys Tyr Asp Val Thr Lys Cys Gly Ala Tyr Gly Gly Gly Gly 
535 540 545 

GAA TAT ATG TCC CAA ACG GGT TTC GGA TGG TCA AAT GGC GTT GTA CTG 18 55 

Glu Tyr Met Ser Gin Thr Gly Phe Gly Trp Ser Asn Gly Val Val Leu 
550 555 560 565 

GCA CTT CTA GAG GAA TTT GGA TGG CCT GAA GAT TTG AAG ATT GAT TGC 1903 
Ala Leu Leu Glu Glu Phe Gly Trp Pro Glu Asp Leu Lys He Asp Cys 
570 575 580 

TAATGAGCAA GTAGAAAAGC CAAATGAAAC ATCATTGAGT TTTATTTTCT TCTTTTGTTA 1963 

AAATAAGCTG CAATGGTTTG CTGATAGTTT ATGTTTTGTA TTACTATTTC ATAAGGTTTT 2 02 3 

TGTACCATAT CAAGTGATAT TACCATGAAC TATGTCGTTC GGACTCTTCA AATCGGATTT 2 083 

TGCAAAAATA ATGCAGTTTT GGAGAATCCG ATAACATAGA CCATGTATGG ATCTAAATTG 214 3 

TAAACAGCTT ACTATATTAA GTAAAAGAAA GATGATTCCT CTGCTTTAAA AAAAAAAAAA 22 03 



AAAA 



2207 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Gly Lys Ala lie lie Phe Met lie Phe Thr Met Ser Met Asn Met 
15 10 15 

lie Lys Ala Glu Thr Cys Lys Ser lie Asp Lys Gly Pro Val lie Pro 
20 25 30 

Thr Thr Pro Leu Val lie Phe Leu Glu Lys Val Gin Glu Ala Ala Leu 
35 40 45 

Gin Thr Tyr Gly His Lys Gly Phe Asp Ala Lys Leu Phe Val Asp Met 
50 55 60 

Ser Leu Arg Glu Ser Leu Ser Glu Thr Val Glu Ala Phe Asn Lys Leu 
65 70 75 80 

Pro Arg Val Val Asn Gly Ser lie Ser Lys Ser Asp Leu Asp Gly Phe 
85 90 95 

lie Gly Ser Tyr Leu Ser Ser Pro Asp Lys Asp Leu Val Tyr Val Glu 
100 105 110 

Pro Met Asp Phe Val Ala Glu Pro Glu Gly Phe Leu Pro Lys Val Lys 
115 120 125 

Asn Ser Glu Val Arg Ala Trp Ala Leu Glu Val His Ser Leu Trp Lys 
130 135 140 

Asn Leu Ser Arg Lys Val Ala Asp His Val Leu Glu Lys Pro Glu Leu 
145 150 155 160 

Tyr Thr Leu Leu Pro Leu Lys Asn Pro Val lie lie Pro Gly Ser Arg 
165 170 175 

Phe Lys Glu Val Tyr Tyr Trp Asp Ser Tyr Trp Val lie Arg Gly Leu 
180 185 190 

Leu Ala Ser Lys Met Tyr Glu Thr Ala Lys Gly lie Val Thr Asn Leu 
195 200 205 



Val Ser Leu lie Asp Gin Phe Gly Tyr Val Leu Asn Gly Ala Arg Ala 
210 215 220 

Tyr Tyr Ser Asn Arg Ser Gin Pro Pro Val Leu Ala Thr Met lie Val 
225 230 235 240 

Asp lie Phe Asn Gin Thr Gly Asp Leu Asn Leu Val Arg Arg Ser Leu 
245 250 255 

Pro Ala Leu Leu Lys Glu Asn His Phe Trp Asn Ser Gly lie His Lys 
260 265 270 

Val Thr lie Gin Asp Ala Gin Gly Ser Asn His Ser Leu Ser Arg Tyr 
275 280 285 

Tyr Ala Met Trp Asn Lys Pro Arg Pro Glu Ser Ser Thr lie Asp Ser 
290 295 300 

Glu Thr Ala Ser Val Leu Pro Asn lie Cys Glu Lys Arg Glu Leu Tyr 
305 310 315 320 

Arg Glu Leu Ala Ser Ala Ala Glu Ser Gly Trp Asp Phe Ser Ser Arg 
325 330 335 

Trp Met Ser Asn Gly Ser Asp Leu Thr Thr Thr Ser . Thr Thr Ser lie 
340 345 350 

Leu Pro Val Asp Leu Asn Ala Phe Leu Leu Lys Met Glu Leu Asp lie 
355 360 365 

Ala Phe Leu Ala Asn Leu Val Gly Glu Ser Ser Thr Ala Ser His Phe 
370 375 380 

Thr Glu Ala Ala Gin Asn Arg Gin Lys Ala lie Asn Cys lie Phe Trp 
385 390 395 400 

Asn Ala Glu Met Gly Gin Trp Leu Asp Tyr Trp Leu Thr Asn Ser Asp 
405 410 415 

Thr Ser Glu Asp lie Tyr Lys Trp Glu Asp Leu His Gin Asn Lys Lys 
420 425 430 

Ser Phe Ala Ser Asn Phe Val Pro Leu Trp Thr Glu lie Ser Cys Ser 
435 440 « 445 

Asp Asn Asn lie Thr Thr Gin Lys Val Val Gin Ser Leu Met Ser Ser 
450 455 460 

Gly Leu Leu Gin Pro Ala Gly lie Ala Met Thr Leu Ser Asn Thr Gly 
465 470 475 480 

Gin Gin Trp Asp Phe Pro Asn Gly Trp Pro Pro Leu Gin His lie lie 
485 490 495 



He Glu Gly Leu 
500 

Lys Asp He Ala 
515 

Lys Thr Gly Ala 
530 

Tyr Gly Gly Gly 
545 

Asn Gly Val Val 



Leu Lys He Asp 
580 



Leu Arg Ser Gly 



He Arg Trp Leu 
520 

Met Tyr Glu Lys 
535 

Gly Glu Tyr Met 
550 

Leu Ala Leu Leu 
565 

Cys 



Leu Glu Glu Ala 
505 

Arg Thr Asn Tyr 



Tyr Asp Val Thr 
540 

Ser Gin Thr Gly 
555 

Glu Glu Phe Gly 
570 



Arg Thr Leu Ala 
510 

Val Thr Tyr Lys 
525 

Lys Cys Gly Ala 



Phe Gly Trp Ser 
560 

Trp Pro Glu Asp 
575 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: -11: 
GGYGGNMGMT TYRWNGARKT MTAYKRYTGG GAC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif iedjoase 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GTNCCNGGNG GNCGNTTYRW NGARKT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 




(A) NAME /KEY : modif ied__base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME / KEY : modi f ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= i 

( ix) FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGNGGYTGNS WNCGNYRNAG RTARTA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE; 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /mod_base= i 



(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /mod_base= 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
NSCRTTNRYC CATCCRAANC CNTC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
CGAAACGGGC CCATCAATTA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
TCGATGAGAT CAATGCCGAG 




(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
CCATCCTAAT ACGACTCACT ATAGGGC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CACAACAGGC TGGTATCCCG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 19: 
CAATAACGAA CTGGGAAGCC 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
ACTCACTATA GGGCTCGAGC GGC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 21 
GAYNTNATNT GGRTNCAYGA YTAYCA 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod__base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION; /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCNACNGTRC ANGCRAANAC 
(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(ix) FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION; 2 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE : 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 2 0 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 23 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TNGGNTKNTT YYTNCAYAYN CCNTTYCC 
(2) INFORMATION FOR SEQ ID NO: 24; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

* 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL : NO 



(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 

(B) LOCATION: 6 

(D> OTHER INFORMATION: /mod_base:= i 



(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TGRTCNARNA RYTCYTTNGC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME / KEY : modif iedjaase 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CCRTGYTCNG CNSWNARNCC 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

( ix) FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TCRTCNGTRA ARTCRTCNCC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 15 



(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /mod_base= 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
GYNACNARRT TCATNCCRTC NC 



